Objective Prediction of Visual Saliency Maps in Egocentric Videos for Content-action Interpretation
نویسندگان
چکیده
Extraction of visual saliency from video is in the focus of intensive research nowadays due to the variety and importance of application areas. In this paper we study the relation between subjective saliency maps, recorded on the basis of gazetracker data in a new upcoming video content: the egocentric video recorded with wearable cameras. On the basis of physiological research and comparing the subjective maps of an Actor performing activities of everyday life and a Viewer who interprets the video after it has been recorded, we identify the temporal shift between these two saliency maps. Using this relation we propose an ”à la carte” prediction of saliency maps of an Actor for the beginning of actions by an objective saliency model we previously developed. All the components of objective saliency: spatial, temporal and central bias are merged in this prediction. The commonly used quality metrics for pixel-based saliency prediction such as Pearson Correlation Coefficient, Normalized Scan Path and Area Under Curve show the good correspondence of predicted maps for Actor and Viewer. This research seems to us promising for content interpretation coming from mobile video recording devices.
منابع مشابه
Compressed-Sampling-Based Image Saliency Detection in the Wavelet Domain
When watching natural scenes, an overwhelming amount of information is delivered to the Human Visual System (HVS). The optic nerve is estimated to receive around 108 bits of information a second. This large amount of information can’t be processed right away through our neural system. Visual attention mechanism enables HVS to spend neural resources efficiently, only on the selected parts of the...
متن کاملVisual saliency maps for studies of behavior of patients with neurodegenerative diseases: Observer’s versus Actor’s points of view
Finding the salient regions in videos has been a very active topic. In this work we compare the modelisation of visual attention on egocentric video recordings for two different points of view. We are interested in finding the relation between the visual saliency maps of the viewer of visual content and the actors (person executing the actions). This question is of importance because the buildi...
متن کاملCan Saliency Map Models Predict Human Egocentric Visual Attention?
The validity of using conventional saliency map models to predict human attention was investigated for video captured with an egocentric camera. Since conventional visual saliency models do not take into account visual motion caused by camera motion, high visual saliency may be erroneously assigned to regions that are not actually visually salient. To evaluate the validity of using saliency map...
متن کاملAttention Prediction in Egocentric Video Using Motion and Visual Saliency
We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vision. The purpose of our framework is to compute attention maps from an egocentric video that can...
متن کاملEgocentric vision IT technologies for Alzheimer disease assessment and studies
Egocentric vision technology consists in capturing the actions of persons from their own visual point of view using wearable camera sensors. We apply this new paradigm to instrumental activities monitoring with the objective of providing new tools for the clinical evaluation of the impact of the disease on persons with dementia. In this paper, we introduce the current state of the development o...
متن کامل